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Abstract — The measure of Jensen-Fisher divergence between 
probability distributions is introduced and its theoretical grounds 
set up. This quantity, in contrast to the remaining Jensen 
divergences, is very sensitive to the fluctuations of the proba- 
bility distributions because it is controlled by the (local) Fisher 
information, which is a gradient functional of the distribution. So, 
it is appropriate and informative when studying the similarity of 
distributions, mainly for those having oscillatory character. The 
new Jensen-Fisher divergence shares with the Jensen-Shannon 
divergence the following properties: non-negativity, additivity 
when applied to an arbitrary number of probability densities, 
symmetry under exchange of these densities, vanishing if and only 
if all the densities are equal, and deflniteness even when these 
densities present non-common zeros. Moreover, the Jensen-Fisher 
divergence is shown to be expressed in terms of the relative Fisher 
information as the Jensen-Shannon divergence does in terms of 
the KuUback-Leibler or relative Shannon entropy. Finally the 
Jensen-Shannon and Jensen-Fisher divergences are compared for 
the following three large, non-trivial and qualitatively different 
families of probability distributions: the sinusoidal, generalized 
gamma-like and Rakhmanov-Hermite distributions. 

Index Terms — Jensen divergences, dissimilarity measures, dis- 
crimination information, Shannon entropy. Fisher information. 



I. Introduction 

The study of the measures of similarity between probability 
densities is a fundamental topic in probability theory and 
statistics per se and because of its numerous applications and 
usefulness in a wide variety of scientific fields, including sta- 
tistical physics, quantum chemistry, sequence analysis, pattern 
recognition, diversity, homology, neural networks, computa- 
tional linguistics, bioinformatics and genomics, atomic and 
molecular physics and quantum information. The most popular 
measure of similarity between two probability densities pi (x) 
and P2{x) is possibly the Jensen-Shannon divergence |jTJ, |j2j, 
which is defined as 



JSD[pi,p2]^S 



Pi + P2 



S[pi] + S[p2 



(1) 



where S[p] denotes the Shannon entropy of the density p{x), 
x e A C M, given by 



S[p] 



p{x) hip{x)dx. 



A 



According to Eq. ([T]l, the Jensen-Shannon divergence quanti- 
fies the Shannon entropy excess of a couple of distributions 
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with respect to the mixture of their respective entropies. It can 
also be expressed as 



JSD[pi,p2]=KL 



Pi, 



Pi + P2 



KL 



P2, 



Pi + P2 



indicating that the Jensen-Shannon divergence is a sym- 
metrized and smoothed version of the KuUback-Leibler diver- 
gence (KLD in short) or relative Shannon entropy (also called 
Kullback divergence) defined Q by 

KL[pi,p2] = I In ^-i-^dx. 

The Jensen-Shannon divergence as well as the KLD are non- 
negative and vanish if and only if the two densities are equal 
almost everywhere. Unlike the KLD, the Jensen-Shannon 
divergence has two additional important characteristics: it is 
always well defined (in the sense that it can be evaluated even 
when pi is not absolutely continuous with respect to P2), and 
its square root verifies the triangle inequality so that the square 
root of JSD[pi, P2] is a true metric in the space of probability 
distributions |5|. Furthermore, it admits the generalization to 
several probability distributions |[T| in the following sense: let 
be a vector u: = (wi, W2, ■ ■ ■ ,ojn) and a set of N probability 
densities {pj{x)}^^i; the Jensen-Shannon divergence among 
these probability densities is given by 

JSD^[pi, ...,pn]= S[ujipi H ^ujnPn] 

- ujiS[pi] ojnS[pn], 

where the nonnegative numbers uji > 0, for i — 1, . . . ,N, 
such that J^i' — 1' are weights properly chosen to indicate 
the relative relevance of each density. This is very useful 
for certain applications such as in bioinformatics, diversity 
and atomic physics where there are situations in which it is 
necessary to measure the overall differences of more than two 
probability distributions. Notice that for A = 2 one has 

JSD^[pi,P2] = S[u!ipi +UJ2P2] -UJlS[pi] - W2'5'[P2], 

so that it simplifies to the expression ([T} in the case uji — 

UJ2 = 5- 

This divergence has been extensively applied in numerous 
literary, scientific and technological areas ranging from infor- 
mation theory [l], 1*61, fl], statistical and quantum mechanics 
8) to bioinformatics and genomics |9|, [10|, atomic physics 
Tl[-||T4| and quantum information ^T5|71}T6t- Let us just 
mention that it has been used as a tool to study EEG records 
p7|, to segment symbolic sequences p8) , to measure the 
complexity of genomic sequences |j9), pO]7to analyze literary 
texts and musical score |19J, to quantify quantum phenomena 



such as entanglement and decoherence |15|, |16| and to 
understand the complex organization and shell-filhng patterns 



2 



of the many-electron systems all over the periodic table of 
chemical elements fTTl-fTTl. 

Nevertheless, the Jensen-Shannon divergence is, at times, 
weakly informative or even uninformative, mainly because 
it depends on a quantity of global character (the Shannon 
entropy) in the sense that it is hardly sensitive to the local 
fluctuations or irregularities of the probability densities. So, 
by definition, this divergence has serious defects to compare 
probability densities with highly oscillatory character. This is 
often the common situation in many fields, such as e.g. in 
the quantum-mechanical description of natural phenomena. To 
illustrate it, let us consider the simple case of the motion of 
a particle-in-a-box (i.e., in the infinite well V{x) — 0, for 
< X < 1, and +oo elsewhere) (2U^. The stationary states 
of the particle are characterized by the sinusoidal probability 
densities 

Pn{x) = 2sin^(7rna::);a; € (0,1), (2) 

and pn{x) — when x ^ (0, 1), where n ~ 1,2, . . . indicates 
the energetic level and label of the state. The divergence 
between the n-th quantum state pn{x) and the ground state 
pi{x) is studied in Figure [T| by means of the Jensen-Shannon 
measure JSD[pn, pi] ■ We observe that this divergence tends 
rapidly to a constant, so that it is not informative enough 
about the enormous differences between these two probability 
densities. 

The case of a particle-in-a-box and other cases pointed out 
later show the necessity for defining a new divergence to be 
able to measure the similarity between two or more oscillating 
probability densities in a much more appropriate quantitative 
form. This is the purpose of our work: to introduce the 
Jensen-Fisher divergence, which depends on an information- 
theoretic quantity (the Fisher information [21 1, |22|) with a 
locality property: it is very sensitive to fluctuations of the 
density because it is a gradient functional of it. This is done 
in Section |ll] where the definition of the new divergence is 
given and its main properties are shown. Then, in Section 



III the Jensen-Shannon and Jensen-Fisher divergences are 
compared in the framework of an information theoretic plane 
for various cases properly chosen to illustrate the relative 
advantages and disadvantages of these two quantities; namely, 
the sinusoidal, generalized gamma and Rakhmanov-Hermite 
probability distributions. Finally, some conclusions and open 
problems are given. 

II. The Jensen-Fisher divergence measure 

In this Section we define a new Jensen divergence between 
probability distributions based on the Fisher informations of 
these distributions, and we study its main properties. In doing 
so, we follow a line of research similar to that of Lin ||T| to 
derive the Jensen-Shannon divergence. 

Let X he a continuous random variable with probability 
density p{x), x G A C M. The (translationally invariant) 
Fisher information of p{x) is given pTj, p2) by 



J A 



dx 



hip{x) 



dx. 



(3) 



and the relative Fisher information between the probability 
densities pi{x) and p2{x) is defined | |23) by the directed 
divergence 



Frcl[pi,P2] = / Plix) 
J A 



dx P2{x) 



dx. 



It is known that this quantity is non-negative and additive 
but non symmetric. The relative symmetric measure defined 
by 



G[pi,P2] = F^cl[Pl,P2] + Frcl[P2,Pl] 
{pi{x) + P2{x)) 



dx P2{x) 



dx. 



is called Fisher divergence f23l, which has been recently used 
in some applications to study the complexity and shell orga- 
nization of the atomic systems along the Periodic Table p2) , 
|14|. As in the Shannon case, this divergence is nonnegative 
and it vanishes if and only if px{x) = P2{x) for any x e A, 
but it is undefined unless that pi{x) and P2{x) be absolutely 
continuous with respect to each other. 

To overcome these problems of the Fid and G divergences, 
we define a new directed divergence between the probability 
densities pi{x) and P2{x) as 



Frc\ [Pl,P2] 



Pi{x) 



A 



dx 



In 



Pi{x) 



Pl{x)+P2(x)) 

2 



dx. 



This quantity is nonnegative because it vanishes if and only 
if pi{x) — p2{x) for any a; e A, and it is well defined even 
when both densities have non-common zeros. In addition, it 
can be expressed in terms of the relative Fisher information 



Frcl[Pl,P2] = F,c 



Pi, 



Pi + P2 



However, it is nonsymmetric. To avoid this problem we 
propose the following symmetrized form 



JFD[pi,p2] = Frcl [pi,P2] + F,.cl [p2,Pl] , (4) 



as a new measure, which we call Jensen-Fisher divergence 
between the probability densities pi{x) and P2{x). From Eqs. 
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Q and ([3]l, we have that 



P2{x) 




Frcl 



P2, 



Pi + P2 



Pl(x)+P2(x) 

2 



dx 



Pl{x)+p2{x} 

2 



dx 



P'ljx) _ p'ljx) + p'^jx) " 

Pl{x) pi{x)+p2{x)^ 



dx 



dx 



P2{x) _ P'ljx) + p'^jx) 
P2{x) pi{x) + P2{x) 



Plix)+P2ix) 



ip2ix)f „ , . , p'lix)+p'2{x) 

ZP2[X) / N , ^ N 

P2{X) pi[x)+p2[X) 

, . Mi^)+p'2i-)r \ 



jp'iix))' , {p'2{x)? M^)+P2i^)y 

Pl{x) P2{x) pi{x)+p2{x) 

, {P'l{x)+p'2{x)f 



Pl{x)+ P2{x) 
{P'lix)? , {P'2ix)? ip[ix)+p'2ix)y 



Pl{x) P2{x) 



Pl{x)^' P2{x) 



SO that the Jensen-Fisher divergence can be expressed in terms 
of the Fisher information as 



Pi + P2 



(5) 



which is similar to the expression ([T]) of the Jensen-Shannon 
divergence in terms of the Shannon entropy, save for a global 
minus sign. 

It is important to remark that the Jensen-Fisher divergence 
we have just introduced, shares the following properties with 
the Jensen-Shannon divergence. First, it is nonnegative be- 
cause of Eq. (jSj) and the convexity of the Fisher information 
which leads to 



F[pi]+F[p2] 



> F 



Pi + P2 



Second, it vanishes if and only if the two involved densities 
are equal almost everywhere in the interval A. This comes 
again from the fact that 



F[pi] + F[p2] 



= F 



Pi + P2 



Pi = P2, 



by keeping in mind the convexity of the Fisher functional; so 
that, 

JFD[pi,p2] = pi = p2- 



Third, it is symmetric because one can straightforwardly 
prove that JFD[pi, P2] — JFD[p2, pi]- Fourth, it is well 
defined when pi{x) or P2{x) are not absolutely continuous 
with respect to each other (of course, as long as the Fisher 
information of each density is well defined) and so, it can 
be used to compare probability distributions with no common 
zeros. 

In addition, the Jensen-Fisher and Jensen-Shannon diver- 
gences satisfy the following deBruijn-type expression 



de 



JSD[pi + Vepc/Oi + \/epG] 



1 



JFD[pi,P2], 



c=0 



(6) 

where pQ is a normal distribution with zero mean and variance 
equal to one. This can be proved by considering the original 
deBruijn's |24| identity between the Shannon entropy and the 
Fisher information: 



\f[p]. 



(7) 



Then, 



— JSD[pi + yrtpG,P2 + \/ePG] 



dt 



s 



£=0 

Pi + y/^PG + P2 + \/iPG 



E=0 



1 d 

2 d~e 



S[pi + ^/epG] 



e=0 
Pi + P2 



1 d 

2 & 



S[pi + V^pg] 



1 d 

2 d~e 

+ V^PG 
1 d 



S[P2 + V^Pg] 



e=0 



2 de 



e=0 

S[P2 + V^Pg] 



Taking into account the deBruijn's identity (|7]l, we obtain 

d 



de 



JSD[pi + ^/e/9G,P2 + n/epg] 



e=0 



2 



Pl + P2 

2 



-\f[p^]-\f[p2] = -\jfd[p,,p2], 



and the identity (j6]l is proved. 

Furthermore, like the Jensen-Shannon divergence, it ad- 
mits a generalization to N densities with different weights 
a; = (wi, clI2, ■ • ■ , w^v), where oj^ > 0, i = 0, 1, . . . , A^, and 



2Zi=l^^ = 1' 

JFD^[pi, 



,PnI 



0JiF[pi] + --- 
~ F[uJiPi{x) 



- ujnS[pn] 

h ujnPn{x)]. 



Finally, let us highlight that the Jensen-Fisher divergence 
is informative even in those cases where the Jensen-Shannon 
is not. This is illustrated in Figure [T| for the particle-in-a- 
box system, whose stationary quantum-mechanical states are 
described by the probability densities (j2]). Therein, we have 
depicted the Jensen-Fisher and Jensen-Shannon divergences 
between the nth-state density pn{x) and the ground state 
Pi{x), given by JFD[pn, pi] and JSD[pn, pi] respectively, in 
terms of n when n is going from 1 to 50. It turns out that, as n 
increases, the Jensen-Fisher divergence increases much more 
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Fig. 1. Jensen-Shannon JSD[p„, pi] (□) and Jensen-Fisher JFD[p„,pi] 
(0) divergences between the sinusoidal densities Pn{x) and pi{x) (see Eq. 
j2j) in terms of the quantum number n. 
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Fig. 2. JSD[pn,pio] — JFD[pn,pio] divergence plane of the sinusoidal 
densities pn{x) (see Eq. |2j) for n = 1, . . . , 50. 



than the Jensen-Shannon, which remains practically constant. 
This clearly indicates that the former divergence is much more 
informative than the latter. 

III. Jensen-Shannon and Jensen-Fisher 
divergences: Mutual comparison 

In this Section we compare the Jensen-Fisher and the 
Jensen-Shannon divergences in the information-theoretic 
JFD — JSD plane for the following three large, qualitatively 
different families of probability distributions: the sinusoidal 
distributions defined by Eq. the generalized gamma-like 
distributions (see Eq. dSll below) and the Rakhmanov-Hermite 
distributions (see Eq. ^ below). 

A. Sinusoidal densities 

These probability densities given by Eq. (|2]i have been used 
to describe various physical systems, such as e.g. the stationary 
quantum-mechanical states of a particle-in-a-box (i.e., in an 
infinite potential well) |20| as already mentioned. Indeed, 
they characterized the ground state pi{x) and the excited 
states Pnix), with n = 2,3, . . ., of this quantum mechanical 
system. In Figure [T] previously discussed, we have shown that 
the excitation of the particle is described in a much better 
information-theoretical way by the JFD than by the JSD, 
since the former divergence between the probability density 
of the nth-excited-state and the ground state increases when n 
(so, when the energy of the particle) is increasing, while the 
JSD remains practically constant. 

In Figure [2] we have depicted the Jensen-Shannon diver- 
gence JSD[pn, pio] between the excited states with quantum 
number n and 10 against the corresponding Jensen-Fisher 
divergence JFD[pn,pio] for n — 1,...,50. The resulting 
values (points) obtained for increasing n are joined by a 
line to guide the eye. It is observed that the JSD remains 
constant except for some points. They correspond to values of 
n multiple and sub multiple of 10, that is the quantum number 
of the reference state piQ. At these points, Pn{x) and pio{x) 
share a number of zeros, so these densities become more 
similar to each other, and both JSD and JFD achieve a lower 



value. Less dramatic deviations are observed also for values of 
n — 15, 25, 35, . . ., where the density pn{x) has some of the 
zeros of pio{x). From a quantum-mechanical point of view, 
the particles on those states share some common forbidden 
regions (or also some common maximum probability regions). 
The behaviours of the JSD and JFD measures on this plane 
shows that although both quantities are sensitive to the overlap 
of the zeros, the JFD highlights this phenomenon much better 
(please, be aware of the different scaling in the axes of the 
figure). Moreover, the JFD presents larger absolute variations 
and has a much wider range of variation than the JSD along 
all the pairs of states considered. 

B. Generalized gamma-like densities 

In contrast with the previous case (where the densities have 
several zeros in a finite interval), here we consider a family of 
one-parameter densities having at most one zero and defined 
in the whole real line; namely, the gamma-like densities given 
by 

^x) ^ (V22h (^))" i^i'^^p (-y) ' ^ > 1' 

(8) 

so that for /? = 0, one has a normal distribution: 

= 7o(x) = exp (^^y) ■ 

In what follows we assume that f3 > 1 because the Fisher 
information (|3]l is not defined for < /3 < 1, having a vertical 
asymptote at /3 = 1. 

We have done two different analyses. First, in Figure |3] 
the values of [7/3, 7] and JS'£'[7^,7] are given as a 

function of (3. It shows that the Jensen-Fisher divergence is 
much more sensitive to the multiplicity of the zero than the 
Jensen-Shannon divergence. While the former varies along a 
range of six orders of magnitude, the latter only varies along 
one order of magnitude. 

Second, Figure|4]shows the JSD[-f, jp] — JFD[f, 7^] plane 
between the probability densities 7(2;) and ^p{x) for all values 
P from 1 to 80. Notice that there are two regimes, one for (3 < 
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Fig. 3. Jensen-Shannon J5Z)[7^,7] (dashed hne) and Jensen-Fisher 
JFD[7^,7] (solid line) divergences between the generalized gamma den- 
sities 'yp{x) and 'y{x) (see Eq. |8|) as functions of the multiplicity parameter 
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Fig. 4. JSD [7^3,7] — J_F_D[7^,7] divergence plane of the generalized 
gamma densities 7^ (x) and 7(2:) (see Eq. jsj) as functions of the multiplicity 
parameter /3. 



I and /? > 14 where the JSD remains almost constant and 
the JFD varies rapidly, and another for | < /3 < 14 where 
the JFD remains almost constant while the JSD varies. As 
in the previous example, the range of variation of the JFD is 
much wider than that of JSDY^^^p]. 



C. Rakhmanov-Hermite densities 

Let us now consider the class of Rakhmanov-Hermite 
probability densities defined by 

1 



(9) 



where Hn{x) is the orthogonal Hermite polynomial of degree 
n. As for the quantum infinite well previously discussed, 
the parameter n = 0,1,2,... indicates the energetic level 
and labels the corresponding state. They have been shown to 
correspond to the quantum-mechanical probability densities 
of the ground and excited stationary states of the isotropic 
harmonic oscillator (HO, in short); see e.g. |25|, |26|. 



has an increasing sawtooth behaviour while 
p^iP] firstly decreases down to zero when n 



Here we have done three analyses. Firstly, we depict in 
Figures [3] and [6] the Jensen-Fisher and Jensen-Shannon di- 
vergences, respectively, between the nth-density and each of 
the reference probability densities with rij. = 0, 10 and 40; 
this is to say the quantities JFZ?[/5j^*-', *-'] (dotted line), 
JFDIp^'^.P^^] (solid line) and JFD[p^°,p^^] (dashed 
line) and the corresponding JSDs. We observe from the 
comparison of the dotted lines of the two figures that both 
divergences between the nth-state density p^'~'{x) and the 
ground-state density pa{x) have a increasing behaviour in 
terms of the quantum number n as one should expect. More- 
over, from the comparison of the solid lines of the two figures, 
we realize an opposite behaviour in the two divergences be- 
tween the nth-state density p^'-'lx) and the lOth-state density 
PuP(^) when the quantum number n (which controls the 
number of zeros of the density) is increasing; namely, the 
JFD[p^'=> 
the JSD[p^° 

goes from to the reference number 10, and then increases 
when n goes from 10 upwards. A similar trend is observed 
from the comparison of the dashed lines of the two figures 
for the JF£)[<o, pgp] and JSDip^'^, pf^] divergences but 
now with respect to the reference number 40. Clearly, in 
the three cases (n, n^) = (n, 1), (n, 10) and (n,40) the 
Jensen-Fisher divergence has always higher variations than the 
Jensen-Shannon divergence, because the JFD has a stronger 
sensitivity than the JSD to the increasing oscillatory character 
of pn{x) when n is increasing. In addition we observe that 
the Jensen-Fisher divergence presents two maxima around the 
reference value (maxima at 9 and 11 for n = 10, and at 39 
and 41 for n — 40) that can be explained taking into account 
the relative position of the zeros of the two involved densities. 
In those cases each zero of one of the densities is situated 
between two zeros of the other density, so none of the zeros 
of one of the densities are near the zeros of the other one. The 
opposite situation occurs for the local minima that appears in 
the graphics, where some zeros of a density are near the zeros 
of the other one. The Jensen-Shannon divergence also shows 
the latter feature but with much less intensity. However, it does 
not show the local maxima around the reference value. 

Our second analysis is shown in Figure |7] where we study 
the comparison of the JSD and JFD divergences between 
the pairs of probability densities with quantum numbers (n, 0), 
(n, 10) and (n, 40) in the frame of the JSD - JFD di- 
vergence plane. This figure combines the results contained 
in the two previous Figures |5] and |6] and shows again the 
overall increasing behaviour of the Jensen-Fisher divergence 
and its much higher values, in contrast to the Jensen-Shannon 
divergence. The most important feature that this Figure shows 
is the separation of the clouds of points in the direction of 
increasing JFD, while these clouds are not distinguishable 
from their JSD values. Let us mention that the two couples 
of points to the right of the vertices of the V-shaped structures, 
correspond to the local maxima that appear in Figure [5] 

Finally, in Figure |8] we use again the JSD — JFD plane 
as a tool to simultaneously show the distance or divergence 
of the pairs of probability densities with quantum numbers 
(n, n + 1), (n,n + 10), (n, 2n), (n, 2n + 10), (n, 3n) and 
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Fig. 5. Jensen-Fisher divergences J F D{p^'-^ , p^'~'] (dotted line), 
JFD[pHO pHO] (soijjj ling) and JFD[pHO pHp] (daslied line) between 
the nth-excited state and the ground state, lOt/i and 40t/i-excited states of the 
isotropic harmonic oscillator, respectively, in terms of the quantum number 
n. 



Fig. 8. JSD — JFD divergence plane of the isotropic harmonic oscillator 
for the pairs of stationary states (n, rrt) = (n, n + l) (□), (n, n + 10) (0), 
(n, 2n) (x), (n, 2n+10) (A), (n, 3n) (+) and (n, 4?i) (*), for several values 
of n from n = up to a value of the JFD of 240. 
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Fig. 6. Jensen-Shannon divergences J S D[p^'~' , p^'~'] (dotted line), 
JSD[pHO, pHO] (sQjijj ling) j^nd J5D[pHO pHO] (dashed line) between 
the nth-excited state and the ground state, 10th and 40t/i-excited states of 
the isotropic harmonic oscillator, respectively, in terms of the quantum number 
n. 
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Fig. 7. JSD — JFD divergence plane of the isotropic harmonic oscillator 
for the pairs of stationary states (n, m) = (n, 0) (□), (n, 10) (0) and (n, 40) 
(x) when n varies from to 100. 



(rt,4rt). Here we notice that, as n increases, the JFD tends 
to infinity in all the cases, but the JSD tends to a constant. 
We observe that J S D[p^'^ , p^2i\ tends to the same value 
as JSD[pf^,p^%l and JSD[p^°,p^^] tends to the same 
value as JS'Z?[p^'-*. p^^j^g], being those two limiting values 
different from each other Then, we can conclude that this 
asymptotic value of the JSD depends on the relative spread- 
ing of the involved densities. When n tends to infinity, the 
spreading of the density p^'^{x) converges to that of p^^^{x) 
or p^9t^q[x). However, p^'~'{x) is less spread than p^°(a;) 
or P2n+w{^)- Thus, the JSD between those densities tends to 
a different value. This trend is confirmed by the asymptotic 
values of JSD[p^^,p^^] and J S D[p^^ , p^^]. 

As in previous analyses. Figure [8] shows that the JFD has a 
much wider range of variation than the JSD, so that it allows 
us to discriminate between different values of n in a better 
way. However, contrary to what happened in Figure |7j the 
JFD cannot distinguish between the different clouds of points 
of Figure |8] This is a clear illustration of the complementarity 
of both the JSD and JFD when analysing the similarity of 
probability distiibutions. 

IV. Conclusions and open problems 

In this paper the Jensen-Fisher divergence measure is intro- 
duced and its theoretical grounds are shown. In summary, we 
find that the main properties (non-negativity, additivity, sym- 
metry, vanishing, definiteness, deBruijn-like identity) of the 
Jensen-Shannon divergence are shared by the new divergence. 
Moreover, the Jensen-Fisher divergence is applied to three 
large families of representative probability distributions (sinu- 
soidal, gamma-like and Rakhmanov-Hermite distributions) and 
compared with the Jensen-Shannon divergence. Our results 
illustrate that, although both JSD and JFD divergences 
are complementary in the sense that they are sensitive to 
different aspects of the probability distributions, the latter is 
more informative when studying the similarity of oscillating 
densities. 
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Finally we should immediately point out three open issues. 
First, the square root of JSD is known to define a metric 
lj5J, [27 J . Does the Jensen-Fisher divergence defines another 
distance metric for probability distributions beyond the JSD 
l|5)-||7)' p7 ) and the variational distance |7|?. This is still 
an open problem which deserves much attention per se and 
because of its so many implications in numerous scientific and 
technological fields. Second, some generalizations of the JSD 
have been recently introduced such as the Jensen-Renyi [28J 
and Jensen-Tsallis p8), |29|, pO) divergences as well as the 



Jensen divergences of order a \3V\ paying the price of the loss 
of certain interesting properties but gaining more flexibility 
because they have a new degree of freedom provided by its 
parameter q or a, what is very useful in numerous applications 
(see e.g., p2)-p6J). Does the Jensen-Fisher divergence admits 
any generalization?. The answer is yes but this avenue is still 
to be paved. Finally, does there exist a quantum version of 
the JFD based on the quantum Fisher information |37|, [38| 
similarly to the quantum JSD based on the von Neuman 
enti-opy ||27l, ||3T1, ||39)-||42)? 
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